Computer Reliability 



omputer programs de- 
veloped by NASA and 
other government agencies 
have many potential applica- 
tions other than their origi- 
nal purposes. To help com- 
puter users reduce 
automation costs by taking 
advantage of this resource, 
NASA’s Computer Software 
Management and Informa- 
tion Center (COSMIC) col- 
lects, screens, stores and 
distributes programs that 
have secondary applicabil- 
ity. For the most part, 
COSMIC’s customers are in- 
dustrial firms and the appli- 
cations are in industrial re- 
search and development or 
manufacturing technology. 
But often COSMIC finds un- 
usual applications in such 
fields as medicine, meteor- 
ology or education. An ex- 
ample is the use of a NASA- 
developed program by Dr. J. 
Walter Bond at California 
State University, Dominquez 
Hills, who is creating a 
course in computer reliabil- 
ity modeling. 

The course will examine 
three different computer 
programs, one of them 
NASA’s CARE III, the others 
UCLA’s ARIES 78 and ARIES 
82. All three programs are 
designed to help estimate 


the reliability of complex, 
redundant, fault-tolerant sys- 
tems. In computer design, 
software of this kind can 
predict — or “model” — the 
effects of various hardware 
or software failures, a pro- 
cess called reliability 
modeling. 

The three programs have 
strong similarities, but they 
also have many differences. 
By assigning the same set 
of problems to each of the 
programs and analyzing the 
differences, students will be 
better able to understand 
the subtleties of program- 
ming. They will then investi 
gate the “dynamic model- 
ing” approach to computer 
reliability analysis, employ- 
ing failure simulation tech- 
niques to assess the behav- 
ior of an entire system. This 
approach enables diagnosis 
of the cause of a computer 
failure by pinpointing the 
effects of the failure of vari- 


ous components, and it al- 
lows students to see where 
a problem is statistically 
expected to arise. 

Dr. Bond’s course is not 
purely theoretical. Eventu- 
ally, the students’ work will 
be incorporated into a soft- 
ware package that addresses 
the redundancy manage- 
ment of a computer system 
based on VLSI (Very Large 
Scale Integration) or VHSIC 
(Very High Speed Inte- 
grated Circuits). The under- 
lying aim is development of 
advanced reliability model- 
ing tools that would help 
assess future failure rates of 
sealed chip devices; this can 
contribute to a new genera- 
tion of ultrareliable comput- 
ers by allowing action to be 
taken during the design 
phase to provide extra re- 
dundancy where needed, a 
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